SkipIndex: Towards a Scalable Peer-to-Peer Index Service for High Dimensional Data

نویسندگان

  • Chi Zhang
  • Arvind Krishnamurthy
  • Randolph Y. Wang
چکیده

Indexing of high-dimensional data is essential for building applications such as multimedia retrieval, data mining, and spatial databases. Traditional index structures rely on centralized processing. This approach does not scale with the rapidly increasing amount of application data available on massively distributed systems like the Internet. In this paper, we propose a distributed high-dimensional index structure based on peer-to-peer overlay routing. A new routing scheme is used to lookup data keys in the distributed index, which guarantees logarithmic lookup and maintenance cost, even in the face of skewed datasets. We propose a novel nearest neighbor (NN) query scheme that can substantially reduce search cost by sacrificing a small amount of precision. We propose a load-balancing mechanism that partitions the high dimensional search space in a balanced manner. We then analyze the performance of our proposed using a variety of metrics with simulation as well as a functional PlanetLab implementation.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A peer-to-peer replica magement service for high-throughput Grids

Future high-throughput Grids may integrate millions or even billions of processing and data storage nodes. Services provided by the underlying Grid infrastructure may have to be able to scale to capacities not even imaginable today. In this paper we concentrate on one of the core components of the Data Grid architecture the Replica Location Service and evaluate a redesign of the system based on...

متن کامل

A Self-Organising Federation of Alchemi Desktop Grids

Desktop grids presents a next generation platform for aggregating the idle processing cycles of desktop computers. However, in order to efficiently harness the power of millions of desktop computers, the systems or middlewares that can support high level of efficiency, scalability, robustness and manageability are required. In this paper, we propose a scalable and self-organising desktop grid s...

متن کامل

BlueDove: A Scalable and Elastic Publish/Subscribe Service

The rapid growth of sense-and-respond applications and the emerging cloud computing model present a new challenge: providing publish/subscribe as a scalable and elastic cloud service. This paper presents BlueDove, an attribute-based pub/sub service that seeks to address such challenge. BlueDove uses one-hop look-up to organize servers into a scalable overlay. It proactively exploits skewness in...

متن کامل

A qualitative study of adolescents, parents and key informants\' experiences towards the importance of peer groups

Background and Purpose: One landmark of adolescence is the increasing value young people place on friendship and relationship with peers.The aim of this study was to determine adolescents and key informants’ perceptions and experiences regarding to importance of peer groups. Methods: Sixty-seven female adolescents (12–19 years) and 11 key informants, recruited from urban and rural areas from S...

متن کامل

P2P Network Trust Management Survey

Peer-to-peer applications (P2P) are no longer limited to home users, and start being accepted in academic and corporate environments. While file sharing and instant messaging applications are the most traditional examples, they are no longer the only ones benefiting from the potential advantages of P2P networks. For example, network file storage, data transmission, distributed computing, and co...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004